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ABSTRACT 

Large patterns could exist on the microwave sky as a result of various non-standard 
possibilities for the large-scale Universe - rotation or shear, non-trivial topology, and 
single topological defects are specific examples. All-sky (or nearly all-sky) CMB data 
sets allow us, uniquely, to constrain such exotica, and it is therefore worthwhile to 
explore a wide range of statistical tests. We describe one such statistic here, which 
is based on determining gradients and is useful for assessing the level of 'preferred 
directionality' or 'stripiness' in the map. This method is more general than other 
techniques for picking out specific patterns on the sky, and it also has the advantage 
of being easily calculable for the mega-pixel maps which will soon be available. For 
the purposes of illustration, we apply this statistic to the four-year COBE DMR data. 
For future CMB maps we expect this to be a useful statistical test of the large scale 
structure of the Universe. In principle, the same statistic could also be applied to sky 
maps at other wavelengths, to CMB polarisation maps, and to catalogues of discrete 
objects. It may also be useful as a means of checking for residual directionality (e.g., 
from Galactic or ecliptic signals) in maps. 

Key words: cosmic microwave background - methods: numerical - cosmology: ob- 
servations - cosmology: theory - large-scale structure 



1 INTRODUCTION 

The desire to understand the nature of the physical universe 
has driven scientific inquiry throughout history. An impor- 
tant aspect of this has been the search for models which 
accurately describe the structure of the Universe on the 
very largest scales. Only in the last decade have we had the 
means to access information from the largest of the scales 
that are observationally within reach, namely those scales 
corresponding to the light-crossing time in the age of the 
Universe. 

In the absence of a unique mathematical description 
for the structure of space, the only way to pin down its 
large-scale behaviour is through direct observation of fea- 
tures which probe the largest scales. Anisotropies in the 
Cosmic Microwave Background (CMB) give us just such a 
diagnostic tool. Furthermore, since the detection of large- 
angle anisotropies by the COBE DMR experiment (Smoot 
et al. 1992) it is now clear that this is a tool that can be used 
in practice. Although the COBE detection was impressive, 



the limited angular resolution and sensitivity make the data 
set far from definitive. Future CMB experiments, and in par- 
ticular the Microwave Anisotropy Probe (MAP) and Planck 
satellites will provide data that will be far more constraining 
for the structure of the Universe. 

The traditional view is that most of the information 
content in the CMB sky is contained in the angular power 
spectrum of anisotropies. As a result, a great deal of ef- 
fort has gone into obtaining the best estimate of the power 
spectrum from the COBE DMR data (Bennett et al. 1996; 
Gorski et al. 1996; White & Bunn 1995; Bond 1996), as well 
as for other experiments. However, it is certainly possible 
that the CMB sky exhibits some large-scale pattern, not 
discernible from the power spectrum, which would indicate 
some non-trivial large-scale structure. Since the endeavour 
of examining the Universe on its largest accessible scales is 
such a grand one, it is certainly worthwhile to explore all 
statistics which might be used to constrain such patterns. 

Examples of specific such patterns have already been 
investigated, although we would like to stress that the cur- 
rent list is likely to be far from exhaustive, and the best 
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examining the consequences of non-trivial topology, with a 
repetition scale close to the Hubble radius today (Stevens 
et al. 1993; de Oliveira-Costa et al. 1996; Levin et al. 1997; 
Cornish et al. 1998; Souradeep et al. 1998). One manifesta- 
tion of a short periodicity scale in two direction is that the 
sky would appear to be composed of concentric 'stripes,' su- 
perimposed on the background of fluctuations (Starobinsky 
1993). Another class of possibilities are universes with ro- 
tation or shear, which define a preferred direction on the 
sky around which there can be special patterns (Barrow, 
Juszkiewicz, & Sonoda 1985); specific Bianchi models have 
been investigated in detail (Bunn et al. 1996; Kogut et al. 
1997). A further example is the possibility that single large- 
scale topological defects could give rise to particular pat- 
terns, e.g., a step-like discontinuity caused by the largest 
cosmic string (Perivolaropoulos 1998), or a localised pattern 
caused by the last texture to unwind (Magueijo 1995). It is 
also possible that properties of particle fields in the Universe 
could create electromagnetic effects yielding special direc- 
tions (see e.g. Carroll & Field 1997 for a recent discussion). 
These examples represent the cases already studied, where 
there are known patterns which exhibit some directionality. 
We wish to consider the idea of searching for general phe- 
nomena which might give rise to a preferred direction on 
the sky, with no particular prejudice about the nature of 
the pattern. 

The closely related general topic of non-Gaussianity, 
i.e. the correlation between phases of the modes, has also 
received a great deal of recent attention (Ferreira et al. 1998; 
Wandelt et al. 1998; Lewin et al. 1999; Pando et al. 1998; 
Hobson et al. 1998; Novikov et al. 1999; Heavens 1998). 
Many of these studies have made use of statistics calculated 
in the Fourier domain. However, there are significant ad- 
vantages in developing statistics which can be calculated di- 
rectly from pixel values, and which are designed to search for 
specific forms of non-Gaussianity. Of course non-Gaussianity 
could take many forms, each requiring a differently tailored 
statistical test. We consider our specific statistic to be part 
of the arsenal of statistics for assessing the large-scale struc- 
ture of space-time which might involve a preferred direction 
on the CMB sky. Indeed we would claim it is the obvious 
statistic to use for this purpose, as we describe in the next 
section. 



2 A PREFERRED-DIRECTION STATISTIC 

We want to devise a statistic that will tell us whether a sky 
map like that obtained from COBE has a preferred direc- 
tion. In particular, we would like to determine whether tem- 
perature contours on an all-sky map tend to be either elon- 
gated or squashed along some particular direction in space. 
To pick out such a direction, it seems most natural to 
work with the gradient VT of the temperature map. In a 
noisy map, of course, some amount of smoothing will be nec- 
essary before VT is estimated. In particular, suppose that 
we have a sky map with N p pixels. We will estimate the 
gradient of the map by grouping the pixels into N^ localised 
blocks and fitting a linear function to the pixels in each 
block. In this manner, we obtain a gradient vector g for 



By varying the number of blocks N^ (and hence the size of 
each block), we can control the amount of smoothing. The 
appropriate amount of smoothing depends on the resolution 
and the noise level of our data set and must be determined 
by trial and error on mock data sets. 

The precise method for dividing the N p pixels into TVb 
blocks will depend on the exact pixelisation scheme. As we 
will see below, the COBE sky cube pixelisation leads to 
a natural method for grouping pixels: we simply subdivide 
each face of the cube into squares. 

Let us suppose we have estimated VT at each of N^ 
locations. Let g be our estimate of the gradient at loca- 
tion p. Naturally, g must be tangent to the surface of the 
celestial sphere at that location. If f p is a unit vector point- 
ing toward the location of gradient estimate p (specifically, 
say the centroid of the pixels that were used to make that 
estimate), then g must be perpendicular to f p , i.e., 
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(1) 



We assume that we have estimated the gradient at iVb lo- 
cations, so that our data set consists of the JVb vectors 
g 1 , . . . , g N . If there is a preferred direction in the map, then 
g should tend either to line up with some particular direc- 
tion or to avoid some particular direction. To see whether 
or not this happens, we should compute 



f(n) =^2w p (g p 
p=i 



(2) 



for all unit vectors n. The weights w p should be chosen to 
avoid introducing spurious signals into the statistic; we will 
describe how to do this below. If our map has a preferred 
direction n, then f(n) should be either anomalously small 
or anomalously large compared to other directions. That is, 
a large value of the 'directionality' ratio 



P: 



maxfi /(n) 
min ft f(h) 



(3) 



will alert us to the presence of a preferred direction. 

Of course, the ratio of maximum to minimum values is 
not the only statistic one could derive from the function /, 
although, since / is a very simple function (a quadratic in 
n), the number of essentially distinct statistics that can be 
derived from it is limited. We choose this particular statis- 
tic because it is simple, and because it picks out maps for 
which / has either extremely high peaks or extremely deep 
valleys. A preferred direction in a sky map could manifest 
itself either through very large or very small values of /: 
for instance, a model that is topologically small in one di- 
mension will produce a very low value for / when n points 
along the preferred direction, while one that is small in two 
dimensions will produce a high value. Both sorts of maps 
are detected by the statistic T>. 

A major advantage of this statistic over other methods 
of finding preferred directions is that it is extremely easy 
to compute. / is quadratic in n, which means that finding 
its extrema for a given sky map involves nothing more than 
solving a 3-dimensional eigenvalue problem. We will work 
this out in detail below, but first let us consider how to 
choose the weights w p . 
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we have non-uniform sky coverage, requiring non-uniform 
weights. To see why uniform weights are unacceptable, con- 
sider the standard COBE situation where a region near the 
equator has been cut out. That means that the pixel direc- 
tions f p will have z components whose absolute values are 
larger, on average, than their x and y components (these 
components are of course specified with reference to an or- 
dinary Cartesian coordinate system with the z-axis perpen- 
dicular to the equatorial plane). According to equation (1), 
that means that g will have z components that are closer 
to zero than they would be if we had full sky coverage. So 
f(z) will be smaller than f(x) or f(y). That is dangerous, 
because it means that the z direction will look like a pre- 
ferred direction even when it is not. The solution is to use 
non-uniform pixel weights; in this particular case, we would 
give the pixels near the equator higher weights than those 
near the poles. 

We must now work out a prescription for assigning pixel 
weights that will avoid 'false positive' results of the sort de- 
scribed above. Consider the null hypothesis that there is in 
fact no preferred direction. Then each g is a random vector 
with an isotropic distribution in the tangent plane to the 
sphere at that point. That is, g must be in the plane per- 
pendicular to f p , but within that plane it is equally likely 
to point in any direction (for standard Gaussian theories, 
g will be a two-dimensional Gaussian random vector, but 
for the moment we do not need to assume that). We want 
to choose the weights w p so that the function / has no par- 
ticular preferred direction in this case. Specifically, we re- 
quire that the ensemble-average quantity (/) be constant as 
a function of n. 

We can write equation (2) as 



f(n) — h An, 

where the 3x3 matrix A has elements 



A, 



E 

P =i 



w> p g p i g p j , 



(4) 



(5) 



and g p i is the ith Cartesian coordinate of g . Then (/(n)) 
will be independent of n if and only if (A) is proportional to 
the identity matrix. We have not decided how to normalise 
/ yet, so we may as well use that freedom to demand that 
it be equal to the identity: 

(6) 



(Aij) — dij. 
Since 



(7) 



equation (6) constrains the possible choices of weights w p . 
To cast this constraint in a useful form, we have to make use 
of the assumed isotropy of g . Let G p be a 3-dimensional 
isotropic random vector, and choose g to be its projection 
on to the tangent plane: 



(G P 



(8) 



- this is just a simple and convenient way to impose the 
requirement that g be isotropic in the tangent plane. Since 



Then application of equation (8) yields 

3 
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pi / y~Jpa*~Jpj )Tp 



2 ^GpiGpfi)r vl i 



0=1 

+ I 2_^ (GpaGpf3)rp a r v[i J TpiTpj (9) 

\ a ,g=i J 

= r p (Oij — TpiTpj). 

Combining this with equations (7) and (6) , we find that 



Oij — y Wplpi^/pij, 



p=l 



with 



Qp 



jij 'pi'pj- 



(10) 



(11) 



Since equation (10) is symmetric, it gives six constraints on 
the iVb pixel weights w p . Of course, N\> S> 6, so the choice 
of weights is still heavily underdetermined, and we need an 
additional criterion to fix them. We choose to adopt the 
simple and reasonable criterion that the weights should be 
as nearly equal as possible, subject to (10). That suggests 
minimizing the variance of {w p }. In fact, it is somewhat 
more convenient to mini 
let us do that insteadJ 
We want to minimize 



1 ~ / 1 

Var(«,) = — 5>2- U-E^ 



(12) 



P =i 



P =i 



Taking the trace of equation (10), we find that ^ w p — §, 
so the second term in the above equation is constant. (This 
is the reason it's simpler to work with w p than with w p .) 
Hence, all we have to do is minimize 



1 V^ ~2 

2L^ 



(13) 



p=i 



subject to the constraints (10). Introducing a symmetric 3 x 3 
matrix A of Lagrange multipliers, this minimization problem 
becomes 



"p = E Al ' 



(14) 



i,i=i 



Plugging this back into the constraint equation (10), we get 



+ Why Wp instead of w p ? There are three reasons: (1) as we shall 
see, the mathematics is slightly simpler; (2) P p is proportional to 
the mean-square value of g , which contains contributions from 
both signal and noise - typically, any variation of P v from pixel 
to pixel will be due to noise variation, so large values of P v will 
correspond to noisy patches, and it can't hurt to down-weight 
noisy patches a little; (3) as we shall see below, we are going 
to do quite a bit of smoothing before estimating the gradient, 
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k,l=l 



A fe iQ 



ijkl . 



where 



^b 



Qw^QmQ 



pkl • 



(15) 



(16) 



p=i 



Equations (15) form a 6-dimensional linear system that is 
easily solved for A. Once A is known, equation (14) yields 
the weights w p . 

Once we have chosen the weights, obtaining the extrema 
of /, and hence the statistic T>, is extremely rapid. Let to be a 
unit vector that points in the direction of one of the extrema. 
To find n, we must extremise / subject to the constraint 
that n have magnitude one. This constrained extremisation 
problem is solved by introducing a Lagrange multiplier: for 
each of the three Cartesian components hi of n, we set the 
derivative of / with respect to hi equal to the derivative 
of the constraint equation y\ hf = 1, multiplied by the 
Lagrange multiplier A: 



2 > Aijhj = \(2hi). 

3 = 1 



(17) 



This set of three equations is equivalent to the matrix equa- 
tion 



An = An. 



(18) 



In other words, the locations of the extrema of / are eigen- 
vectors of A. Furthermore, the extreme values themselves 
are given by the eigenvalues. 

The 3x3 symmetric matrix A always has three eigen- 
vectors, so / has three critical points, one maximum, one 
minimum, and one saddle point. Once the elements of A 
have been computed (requiring 0(N P ) operations), all we 
have to do to find T> is solve a 3 x 3 matrix eigenvalue prob- 
lem and take the ratio of the largest and smallest eigenval- 
ues. 



3 COMPARISON WITH OTHER METHODS 

We can now search the COBE DMR data set for specific di- 
rectionality and compare our results with other approaches. 
We adopt as our basic data set a weighted average of the 53 
and 90 GHz A and B sky maps, in ecliptic pixelisation, with 
pixels near the Galactic plane removed following the COBE 
DMR group's 'custom cut' (Bennett et al. 1996). 

A best-fitting monopole and dipole are removed from 
the data set at the beginning of the analysis. Dipole removal 
is absolutely crucial in nearly all large-angle CMB analysis 
(Tegmark & Bunn 1995), but it is more than usually impor- 
tant in this case, since the statistic T> is extremely sensitive 
to the dipole. (If we didn't remove the dipole, the function 
/ would have an enormous peak oriented along the dipole 
axis.) Fortunately, least-squares fitting removes all trace of 
the dipole: two maps that differ only by a dipole will look 
identical after least-squares dipole removal. 

In order to compute a smoothed gradient map, we must 
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Figure 1. How pixels arc grouped into blocks. The dots represent 
some of the pixels on one face of the COBE sky cube. The four 
boxes show the four overlapping blocks to which the 16 central 
pixels (represented by the solid circles) belong. In this manner, all 
pixels belong to multiple blocks. Any blocks that extend off the 
edges of the face continue to adjacent faces. In addition, there are 
eight blocks that cover the corners of the sky cube — each of these 
eight blocks has only 48 pixels, rather than 64, since it consists 
of one 4x4 'sub-block' on each of the three cube faces that meet 
at that corner. 



the DMR maps, in which the sky is divided into six faces, 
with each face subdivided into a 2 J x 2 J square of pixels, 
suggests a natural way of grouping pixels: we subdivide each 
face into square blocks with 2" pixels on a side. The appro- 
priate value of n (i.e., the best smoothing level) is found by 
Monte Carlo simulations: n should be chosen to maximise 
the probability with which maps that contain a preferred 
direction can be distinguished from those that do not.^ The 
simulations, which will be described in more detail below, 
lead to the conclusion that maximum sensitivity is typically 
achieved when n = 3, meaning that each gradient estimate 
is computed by fitting a linear function to a square block of 
64 pixels. The blocks need to be large because the COBE 
signal-to-noise ratio is relatively low. 

With such large blocks, we have relatively few gradient 
estimates to work with. In order to increase the number of 
data points, we allow the blocks to overlap, so that each 
pixel is contained in four blocks as shown in Fig. 1.™ With 
this arrangement, there are 378 blocks in a full-sky map, but 



3 In order to avoid biasing the final results, it is essential that 
choices like this one be made solely on the basis of simulated 
maps, without looking at the real data. 
"I Naturally, this scheme induces correlations between different 
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Figure 2. Pixel weights. These arc our calculated weights for 
blocks of pixels in the COBE map, using the 'custom cut' of the 
Galactic plane. 



only 199 remain after the Galactic cut is applied (we choose 
to reject a block if the cut removes more than 20 per cent 
of its pixels) . 

Now that we know the pixel locations, we can compute 
the weights w p to be used in the preferred-direction search. 
The weights are plotted in Fig. 2 as a function of Galactic 

latitude. II As expected, the Galactic cut causes pixels near 
the poles to be given less weight than those near the equator; 
in fact, about 23 per cent of the pixels have negative weight. 

In order to assess the efficacy of the T> statistic in dis- 
tinguishing models with a preferred direction from those 
without, we chose to focus on a specific model with pre- 
ferred direction - we performed Monte Carlo simulations of 
flat toroidal Universes. When all three torus dimensions are 
large, the temperature distribution is, to an excellent ap- 
proximation, an isotropic Gaussian random field, with no 
preferred direction. When one or two dimensions are made 
small, though, a preferred direction emerges. 

The upper panel of Fig. 3 shows the results of some of 
these simulations. One histogram shows the distribution of 
T> for a standard, isotropic, Gaussian model with an n = 1 
Sachs- Wolfe power spectrum, C\ oc 1/1(1 + 1). The other two 
histograms represent models with a preferred direction. The 
plot labelled 'Tl' is a model that is topologically small in one 
dimension and large in the other two. The small dimension 
has a length of 0.3 times the comoving distance to the last- 
scattering surface. The plot labelled 'T2' is as small as this 
in two dimensions. All of these simulations were made with 
signal-to-noise ratios that match that of the DMR data. 

As expected, the topologically small models do have 



with Monte Carlo simulations that automatically account for 
these correlations. 

II In order to compute the weights, we need to know P p , the mean- 
square value in the pth gradient estimate. These values depend 
weakly on the cosmological model chosen. For Fig. 2, the weights 
were computed using a standard n = 1 Harrison-Zcl'dovich spec- 




Figure 3. The distribution of the directionality statistic, T>. 
These histograms show the distribution of the statistic T> in Monte 
Carlo simulations. The three cases shown are an isotropic Gaus- 
sian model and toroidal models with one and two 'small' dimen- 
sions of 0.3 times the distance to the last-scattering surface (la- 
belled Ti and T2, respectively). The top panel shows results of 
simulations with COBE signal-to-noise levels; the signal-to-noise 
in the bottom panel was inflated by a factor of five. 



higher values of T> on average. Unfortunately, the distribu- 
tion is quite broad, reducing the statistic's ability to dis- 
tinguish the isotropic model from the anisotropic ones (al- 
though for much shorter small dimensions the directionality 
would be easily noticed). This is due in large part to the 
poor signal-to-noise ratio in the DMR data, as the lower 
panel of Fig. 3 shows. This figure was produced in the same 
way as the upper panel, except that the signal-to-noise ratio 
was increased by a factor of five. 

Table 1 shows one way of quantifying the power of 
this statistic. We created simulated sky maps for both an 
isotropic Gaussian model and a set of toroidal models with 
one and two small dimensions. We computed the value of T> 
for all of the realisations. For each anisotropic sky, we then 
asked the following question: is the value of T> produced by 
this sky large enough to rule out the isotropic model at 95 
per cent confidence? In other words, we determined whether 
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Table 1. Probability (in per cent) of detecting a topologically 
small Universe at 95 per cent confidence in Monte Carlo simu- 
lations. The models have one or two small directions (Tl and 
T2, respectively) with COBE noise, and using either the COBE 
signal normalisation, or 5 times that value. 



Length 


Tl 


T2 


Tl 


T2 


scale 


COBE 


COBE 


5x COBE 


5x COBE 




norm. 


norm. 


norm. 


norm. 


0.3 


51.2 


75.6 


74.9 


95.8 


0.5 


22.4 


35.5 


32.8 


43.7 


0.8 


8.0 


7.2 


6.0 


9.2 



of the time that this occurs is the probability that a pre- 
ferred direction would be detected (i.e., that isotropy would 
be ruled out) at 95 per cent confidence. 

Table 1 shows these detection probabilities for both Tl 
and T2 models with length scales of 0.3, 0.5, and 0.8 times 
the distance to the last-scattering surface, and for signal- 
to-noise levels equal to the COBE value and five times the 
COBE value. In order to see how well our statistic compared 
with others, we implemented the 'S' statistic described in 
(de Oliveira-Costa et al. 1996) and performed similar sim- 
ulations with this statistic. We find that the S statistic is 
more powerful than ours: for instance, when the S statistic is 
used, the last row of table 1 changes to 11.4, 20.6, 15.8, 25.0 
per cent. However, we should stress that this is precisely the 
case for which S was developed; our statistic is much more 
general, and we will discuss its utility in the next section. 

We have focused primarily on the results of simulations, 
since our main interest is in assessing the power of the statis- 
tic. However, for completeness we now mention the limits 
that can be placed on topologically small Universes by ap- 
plying this statistic to the actual DMR data. The value of 
T> for the real data is 1.74. The maximum value of / occurs 
at Galactic coordinates (l,b) = (84°,— 22°), and the mini- 
mum is at (343°, —26°). A map of / for the real data is 
shown in Fig. 4. For comparison, a map of / for a sky with 
a prominent hot spot added is also shown. (The dullness of 
these maps is not surprising: since / is a quadratic form, it 
cannot have much structure.) 

The fact that the directions found for the COBE data 
are not particularly close to various 'special' directions in the 
sky is reassuring. (Even more reassuring is that the maxima 
and minima of / are distributed isotropically over the sky in 
our simulations.) The maximum value of / occurs 68° from 
the Galactic pole, 84° from the Galactic centre, and 26° from 
the CMB dipole (Lineweaver et al. 1999). The minimum 
value occurs at 64°, 31°, and 78° from these three directions. 

The directions picked out by our statistic do not lie 
especially close to prominent hot or cold spots in the data. 
The three most significant spots in the DMR data, as listed 
by Cayon & Smoot (1995), are all at least 35° from the 
locations of both the maximum and minimum values of /. 

Bromley & Tegmark (1999) have identified another po- 
tentially significant direction in the COBE data. They point 
out that a recent claimed detection of non-Gaussianity in 
COBE (Ferreira et al. 1998) can be made to disappear if 



pixels near (l,b) = (257°, 39.5°) are excised from the data. 
This direction is about 18° from the maximum value of /. 
While this is a closer match than any of the other directional 
comparisons made above, it is not close enough to be statis- 
tically significant, and we suspect it is a mere coincidence. '' 
We can use the value of V for the real data to rule out 
various models in the usual way: if a model produces T>- 
values larger than the real data 95 per cent of the time in 
simulations, then it is ruled out at 95 per cent confidence. 
For a T2 model, we can rule out a length scale of 0.3 (in 
units of the comoving distance to the last-scattering sur- 
face) for the small dimensions at 95 per cent confidence. For 
Tl models, the limits are somewhat weaker: there does not 
appear to be a 95 per cent confidence limit, but a length 
scale of 0.2 is ruled out at 90 per cent confidence. As we 
might expect in light of the comments above, these limits 
are weaker than those set by other more focused techniques 
such as the S statistic. But they are quite general, and could 
also be used to constrain any other family of models which 
possess directionality. 



4 UTILITY 

Since the next generation of satellite experiments will pro- 
duce extremely large (mega-pixel) data sets, it is important 
to devise statistical techniques that can be rapidly and ef- 
ficiently calculated. This is an especially important concern 
in cases where the statistic has to be calibrated with Monte 
Carlo simulations. The statistic T> described in this paper 
can be evaluated for an JV p -pixel map with 0(N P ) calcula- 
tions, which means that extensive Monte Carlo simulations 
will be possible even with mega-pixel maps. 

The slowest step in the calculation of T> is estimating the 
gradient map. Suppose the map has N p pixels divided into 
iVb blocks, so that there are M = N p /Nh pixels per block. 
Then each gradient estimate involves fitting a linear func- 
tion to M data points, at a cost of 0(M 2 ) operations. The 
total operations count is therefore 0{M 2 N^), or 0(MN p ), 
with a prefactor of order unity. The appropriate value of M 
in future experiments is unlikely to be much greater than 
the value M = 64 used above: indeed, it may prove to be 
smaller, since future maps will have lower pixel noise than 
COBE. It should therefore be possible to estimate T> with an 
operation count of order 100JV P . Even for data sets as large 
as those expected from MAP and Planck, it will be possible 
to calculate the statistic, and calibrate it with simulations, 
quite rapidly. 

This statistic may also prove useful for sky maps at 
other wavelengths, such as x-ray and infrared maps. It can 
also be generalised to apply to maps of point sources by 
pixelizing the map in pixels large enough to contain many 
sources each. Thus it could be applied to galaxy or star 
catalogues which cover a sufficiently large solid angle. 

The technique described in this paper can provide a 
valuable test for Galactic or zodiacal contamination in sky 



TT In the past three paragraphs, we have compared the location of 
the maximum and minimum of / to seven locations on the sky. If 
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be searched for, in order to constrain particular models, but 
there will also be general tests for patterns on the sky or 
general sorts of non-Gaussianity which might be present. 
Among those basic properties to search for, directionality 
is quite fundamental, and could be connected with various 
models for the structure of the Universe on the largest scales. 
We have discussed a general method for finding directional- 
ity, the T> statistic, which requires no particular assumptions 
about the specific form of the directional pattern. This is a 
rather obvious statistic, being quadratic in the gradient of 
the temperature field, and with the only complication being 
the derivation of the appropriate weights to use when the 
map is not all-sky. 

For polarisation maps, the application of the T> statistic 
is even more straightforward, because the gradient does not 
have to be taken. And since the statistic is quite general, it 
could obviously be applied to maps at other wavelengths. For 
large-angle catalogues of discrete objects (e.g. x-ray clusters 
or radio galaxies) this statistic could also be utilised, after 
first extracting a smooth field from the catalogue. As well as 
providing constraints on tera-parsec scale exotica, there is a 
much more mundane use for the T> statistic, namely testing 
for residual foreground or other systematic effects. For large 
maps our statistic will be particularly easy to apply, since it 
is straightforward and fast to calculate, a great advantage 
given the sizes of data set which are about to deluge us. 



Figure 4. Maps of ///max in Aitoff-projcctcd Galactic coordi- 
nates. The Galactic centre is at the centre of the maps; Galactic 
longitude increases from right to left. The real DMR data were 
used to produce the top panel; for the bottom panel, a large 
prominent hot spot, with an amplitude of 200 /iK and a 20° ra- 
dius, centred at (l,b) = (45°, 45°), was added to the data. The 
grey-scale runs from zero to one, so the low level of contrast 
in the top panel illustrates that T> is not very large (i.e., that 
/min//max = 1/T> is not very small). 



maps. Suppose we calculate T> for a map from which such 
signals are supposed to have been removed. If T> proves to be 
large, and if the maximum or minimum value of / occurs in a 
direction close to the Galactic or ecliptic pole, then we would 
have good reason for suspecting residual contamination. 

The T> statistic can also naturally be applied to polar- 
isation maps. In fact, the application to a polarisation map 
is even simpler than the technique described in this paper, 
since one can simply skip the initial gradient estimation and 
use the polarisation measurements themselves in place of 
the gradient estimates. (Note that the function / depends 
quadratically on g , so the fact that a polarisation vector is 
defined only up to a sign does not matter.) Since the cos- 
mic polarisation signals for MAP and Planck are expected 
to be rather weak, and the galactic signals fairly uncertain 
on large scales, the T> statistic may prove useful in checking 
future polarisation maps for residual galactic contamination. 



5 CONCLUSIONS 

When the MAP and Planck satellites return their high- 
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